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ABSTRACT 

Translation of a gene is assumed to be efficient 
if the supply of the tRNAs that translate it is high. 
Yet high-abundance tRNAs are often also at high 
demand since they correspond to preferred 
codons in genomes. Thus to fully model transla- 
tional efficiency one must gauge the supply-to- 
demand ratio of the tRNAs that are required by the 
transcriptome at a given time. The tRNAs' supply is 
often approximated by their gene copy number in 
the genome. Yet neither the demand for each 
tRNA nor the extent to which its concentration 
changes across environmental conditions has 
been extensively examined. Here we compute 
changes in the codon usage of the transcriptome 
across different conditions in several organisms by 
inspecting conventional mRNA expression data. 
We find recurring dynamics of codon usage in the 
transcriptome in multiple stressful conditions. In 
particular, codons that are translated by rare 
tRNAs become over-represented in the transcrip- 
tome in response to stresses. These results raise 
the possibility that the tRNA pool might dynamic- 
ally change upon stress to support efficient trans- 
lation of stress-transcribed genes. Alternatively, 
stress genes may be typically translated with low 
efficiency, presumably due to lack of sufficient evo- 
lutionary optimization pressure on their codon 
usage. 

INTRODUCTION 

Organisms have evolved means to tune the translational 
efficiency of their genes to different desired levels. 
Facilitating this mode of regulation is the redundancy of 
the genetic code — synonymous codons are translated to 
the same amino acid, but their corresponding tRNAs 
might differ by their amounts in cells. Common 



measures of translational efficiency assess genes by either 
measuring the correlation between their codon usage 
pattern to that of selected 'elite' highly expressed genes 
(1,2), or by an explicit weight of the availability of 
tRNAs that translate them (3). One Such measure is the 
tRNA adaptation index, tAI (4), which deduces the abun- 
dance of the various tRNAs from their gene copy number 
(GCN) in the genome. The tAI measure predicts with rea- 
sonable accuracy both mRNA and protein levels (5,6). 

Yet, recent studies suggest that models of translational 
efficiency should be more comprehensive [reviewed in (7); 
(8,9)]. In particular, the concentration of the various 
tRNAs can vary to different extents between conditions 
and tissues (9-13) and so is their base modification and 
amino acid loading (14-17). In addition codon usage was 
shown to vary between tissues (18). Toward a comprehen- 
sive model of translational efficiency we focus here on an 
unexplored important consideration — the supply-to- 
demand ratio of the different tRNAs that translate 
mRNAs. 

Specifically, we explore the potential changes in the 
demand for each tRNA, namely the abundance of the 
corresponding codon(s) in the transcriptome at various 
biological conditions. We suggest a simple means to 
mine mRNA expression data in order to explore the 
dynamic codon usage across conditions and species. We 
find that under stress conditions the demand for tRNAs 
by certain codons increases, whereas the extent of repre- 
sentation of other codons in the transcriptome decreases. 
Interestingly the codons whose representation increases 
the most in the transcriptome upon stress correspond to 
tRNAs that are represented by the lowest GCNs in the 
genome in each of the examined species. This situation 
suggests two possible explanations: the supply for these 
codons increases too in stress, e.g. by increased production 
of the corresponding tRNAs, or that stress-related genes 
remain relatively poorly translated. We constructed a 
simulated evolution model that uses two minimalist as- 
sumptions: that stress genes are expressed infrequently 
during evolution, and that the supply of tRNAs in the 
cell is limited. Results from the simulation are consistent 
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with the hypothesis that limited codon adaptiveness of the 
stress genes results from lack of evolutionary constraint to 
optimize them and due to the limitation in the supply of 
tRNAs. Reassuringly the simulation reproduces codon 
usage differences observed in genomes. We thus suggest 
that the stress transcriptome remains poorly translated 
due to compromised translational efficiency of the stress 
genes. 



original tAI measure, and translational efficiency of 
individual codons (originally defined by dos Reis 
et al. as the 'absolute adaptiveness value 1 , W t ). Briefly, 
Wi defines the adaptiveness of an individual codon by 
the availability of the tRNAs that serve in translating it, 
incorporating both the fully matched tRNA, as well as 
tRNAs that contribute to translation through wobble 
rules (25-27). Formally, the 'absolute adaptiveness 
value' for the z'-th codon is 



MATERIALS AND METHODS 

Data sources 

The affymetrix platform of micro-arrays was used to 
allow absolute, i.e. non-relative, measurements of 
mRNA abundance. Normalized mRNA abundances for 
Saccharomyces cerevisiae following oxidative stress and 
DNA damage by methyl methanesulfonate (MMS) were 
downloaded from (19) and in heat shock, oxidative stress 
and osmotic stress from (20). Normalized mRNA abun- 
dances of S. cerevisiae during a 1 5 days process of wine 
fermentation (21) were downloaded from GEO (Gene 
Expression Omnibus, www.ncbi.nlm.nih.gov/geo), GEO 
accession: GSE8536. Normalized mRNA abundances of 
Schizosaccharomyces pombe during a short-term response 
to nitrogen starvation (22) were downloaded from 
ArrayExpress (http://www.ebi.ac.uk/microarray-as/ae/) 
under accession E-TABM-784. Transcriptome profiling 
data for Caenorhabditis elegans in response to oxidative 
stress (23) were downloaded from GEO under accession 
number GSE9301. The tRNA GCNs of species were 
downloaded from the Genomic tRNA Database (http:// 
lowelab.ucsc.edu/GtRNAdb/), (24). 

Generation of condition-dependent demand matrices 

We created condition-dependent 'Codon-Expression' 
matrices for various environmental conditions. A 
'Codon-Expression' matrix, is a 61 x m matrix, which 
denotes the representation of the 61 codons in the tran- 
scriptome over m conditions. This matrix thus depicts the 
demand for each tRNA at each condition. The represen- 
tation of codon i in the transcriptome at a given time/ 
condition k is defined by 



Wi 



J2 (1 - s 9 ) tGCNij (ref. 4) 
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where j is a gene, Cy depicts the number of appearances of 
codon i in gene j, and E /k indicates the mRNA abundance 
of gene j at condition or time point k. In the same manner, 
we generated 'Amino acid Expression' and 'Nucleotide 
Expression' matrices, which similarly depict the represen- 
tation of the 20 amino acid in the translated transcriptome 
or the 4 nucleotides in the transcriptome at a given time/ 
condition k. 

Calculating translational efficiency by the tAI value 

We calculated translation efficiencies of genes using the 
tAI (4). Throughout this article, we distinguish between 
translational efficiency of a gene, which corresponds to the 



where n is the number of tRNA isoacceptors that recog- 
nize the z'-th codon, tGCNy denotes the GCN of the y'-th 
tRNA that recognizes the z'-th codon, and Sy correspond to 
the wobble interaction, or selective constraint on the effi- 
ciency of the pairing between codon i and anticodon j. As 
done in the original tAI formalism by dos Reis et al. the 
absolute adaptiveness value of codon i is further divided 
by the maximum W t (termed W max ), obtaining the codon's 
relative adaptiveness value: 



Wi 



W,/W n 



The tAI value of a gene with L codons is then simply 
calculated as the geometric mean of the w/s of its codons 



tAIfe) 



n 



Based on (9) and (28) we interpret the codon adaptiveness 
values as representatives of translation speed of each 
codon. 

Exploring the balance between drift and selection on 
codon usage by a computational simulation 

We developed a computer evolutionary simulation of uni- 
cellular population of 1 000 000 haploid cells that cope 
with occasional stress periods. The genome of each cell 
consists of six archetypal genes, each of which is 
required during one or more of the simulated growth con- 
ditions (see Supplementary Material for details). We 
envisage a mapping function, such as the tAI score, that 
maps between sequence and expression level. During the 
simulation genes are mutated and as a consequence their 
expression level changes. At the beginning of the simula- 
tion, the six genes are equally scored with an initial value 
of expression level. The population then evolves at a fixed 
mutation rate of 0.001 mutations per genome per gener- 
ation. Sequences are not represented explicitly in the simu- 
lation; instead genes are characterized by an expression 
level that implicitly corresponds to a genotype. Thus, 
'mutated' expression levels at a given time step are 
computed by the previous step's expression levels mul- 
tiplied by a random number drawn from an exponential 
probability distribution of changes in expression 
[as estimated before (29)]. 

We also ran the simulation with a mode in which the 
tRNA supply is not unlimited. This mode of the run sets a 
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constant maximal total expression level from the 
'genome'. In this mode of limited supply of tRNAs, the 
expression of the z'-th gene in each generation, 
lsExpression -, ('Is' stands for limited supply) is defined by 



lsExpression ■ = 



where n is the number of genes in the cell, Expression^, is 
the expression of the i-th gene in the mutated population, 
and MaxExpression is a constant maximal total expres- 
sion level from the whole 'genome'. 

The fitness of a given cell is determined by the arith- 
metic mean of expression of the m genes which are 
required in a given environmental condition 

x _ Em Expr, 
m 

where Expr ; corresponds to either Expression^,- or 
lsExpression^,-, depending on the mode of simulation, i.e. 
if supply is limited or not. 

Our model consists of drift and selection, thus we 
propagate individuals between consecutive generations 
(/— 1) to (t) in two stages. First, a population (whose size 
can be different from that of the population at generation 
t— 1) is formed in which the z'-th genotype population 
size is given by its size in the previous generation and its 
fitness by 

XjQ) » Xi(t-V)e ki 

Then, to keep a constant population size stochastic 
rescaling is applied that implements a Kimura-governed 
(30) allele sampling as the random drift step of the 
simulation. 



RESULTS 

The supply-to-demand balance in translational efficiency 

The speed at which a codon is translated is expected to 
increase with the availability of its supply — amino 
acid-loaded tRNAs. Yet, if the codon is highly represented 
in the genome, and even more importantly, in transcrip- 
tome at a given condition, i.e. if the demand for the tRNA 
is high, then the codon's translational efficiency might be 
compromised. Translational efficiency should thus be 
modeled as a supply-to-demand process, i.e. the ratio 
between the availability of the tRNAs that translate a 
codon — the 'supply', to the extent of representation of 
that codon in the transcriptome at a given moment — the 
'demand'. The supply component is effectively captured 
by the tAI of a codon (4), originally defined as W t and 
termed codon 'adaptiveness value' (see 'Materials and 
Methods' section). The demand component is simply 
modeled as the representation of the z'-th codon in the 
transcriptome, r^(see 'Materials and Methods' section). 



Translational efficiency of the z'-th codon is the supply- 
to-demand ratio: 

^ /SD Supply, codonAdaptivenessValue, Wj 

' Demand; representation,- z-,-/ f 



In the following sections we will focus on measuring the 
dynamics of codon usage across conditions in different 
species. Note that although in the original tAI model the 
supply is constant (modeled by copy number of the tRNA 
genes in the genome) it could in future be represented as a 
dynamic entity too, accounting for the possibility that 
tRNA levels may change across conditions and cell 
types (9-12,14-17). 

mRNA expression data can be mined to deduce 
dynamics of cellular codon usage 

Conventional mRNA expression micro-arrays are typic- 
ally used to study transcription and more recently mRNA 
decay (19,31-37). Here we realized that micro-arrays may 
contain data pertinent to translational efficiency. In par- 
ticular we mine expression micro-array data to compute 
changes in the representation of the various codons in the 
transcriptome in various growth conditions in several 
model organisms. Figure 1 shows as an example the 
change in usage of the six codons coding for arginine 
during response to a DNA damaging agent in the yeast 
S. cerevisiae (see 'Materials and Methods' section). The 
representation of some of the codons increases by as 
much as 25% in stress, whereas others decrease in the 
stressed transcriptome. These changes in codon represen- 
tation might indicate a change in the demand for the 
various tRNAs, and potentially also a change in transla- 
tional efficiency of some genes in stress. Interestingly the 
summed usage of all six arginine codons hardly changes, 
and as so is the usage of the four nucleotides. 

We systemize our inspection of fluctuations of codons 
representation in the transcriptome. By multiplying the 
number of occurrences of each codon in each gene by 
the expression level of each gene in each condition we 
obtain a 'Codon-Expression' matrix which depicts the rep- 
resentation of each codon in the transcriptome in each 
condition (see 'Materials and Methods' section). 

Similarly, for further control purposes, we create 
'Amino acid Expression' and 'Nucleotide-Expression' 
matrices, which correspond, respectively, to the sum of 
codons for each amino acid or the representation of 
each nucleotide in the transcriptome at various conditions 
in a given species. The amino acid expression matrix 
allows us to ask whether changes at the codon expression 
matrix simply reflect changes in the relative appearance of 
the different amino acids at the translated transcriptome 
[as shown in some cases, c.f. (38)] whereas the nucleotide 
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Figure 1. Variations in representation of amino acids, codons and their constituent nucleotides in the transcriptome. Illustration of the case of amino 
acid arginine: shown on the y-axis is the (log 2 ) change in the representation of the six codons of this amino acid in the transcriptome during response 
to the chemical MMS. Four of the codons are increased in their representation following the stress compared with a reference condition, and two are 
diminished. This change is not accompanied by an appreciable change in the representation of arginine in the translated transcriptome, nor can it be 
reduced to a putative change in the amount of the constituent nucleotides which are shown to hardly vary during the process. 



expression matrix weighs the putative changes in the 
nucleotide composition of the transcriptome (39). 

The codon usage of the transcriptome varies upon stress 

We analyzed mRNA expression data for the yeast 
S. cerevisiae in diversity of stressful environmental condi- 
tions, and also during recovery from stress (19-21). For 
each time point in each condition we computed the 
'Codon-Expression' matrix (Figure 2). Qualitatively, the 
pattern of variation in codon usage is highly correlated 
between the various stresses and anti-correlated between 
the stresses and the stress-recovery experiments. The 
codons that are changed the most in stress show an 
increased representation of up to 40% relative to the ref- 
erence condition (Figure 2). In contrast, changes in amino 
acid representation are very minor (Supplementary Figure 
SI), and it is often the case that codons for the same amino 
acid change in opposite directions. The representation of 
the four nucleotides is also relatively constant through- 
out conditions — fold-change values vary between 
0.99-1.01 and 0.98-1.03 for codon position-independent 
and codon position-dependent usage of nucleotides, re- 
spectively (Supplementary Figure SI). Thus codon repre- 
sentation changes are not explained by a need to change 
the amino acid composition of the proteome and cannot 
be reduced to potential change in nucleotide availability. 



In turn this dynamics likely reflects changes in transla- 
tional efficiency. 

Low-efficiency codons are over-represented in the stress 
transcriptome 

We were next interested to check whether changes in 
codon representation in stress, or during recovery from 
stress, are correlated with the abundance of the corres- 
ponding tRNAs. A common simplified proxy for tRNA 
availability is the copy number of the tRNA genes in the 
genome. tRNA GCNs correlate with tRNA concentra- 
tions, at least in non-stressful conditions (9,11,40,41). 
This correlation was recently corroborated in a study 
that examined in several mammals RNA Pol III occu- 
pancy in the vicinity of tRNA genes (13). We thus 
plotted for each codon its representation in the transcrip- 
tome at various conditions along with the GCN of the 
corresponding tRNA, and in addition a more refined 
measure of tRNA availability (called codon adaptiveness 
value, Wi, see 'Materials and Methods' section) that also 
incorporates contributions to the translation of a codon 
through wobble interaction (25) from non-perfectly 
matching tRNAs (Figure 2). Interestingly the codon 
change pattern during recovery from stress correlates posi- 
tively with these two measures of tRNA availability, while 
the changes during stress showed negative correlation 
with the tRNA availability (Figure 3a, Supplementary 
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Figure 2. The fold-changes in codons representation in S. cerevisiae's transcriptome across environmental changes. Left panel — fold-changes in the 
representation of the 61 codons in response to heat shock, osmotic and MMS stresses, and when potassium chloride (KCL) and heat-shock stresses 
were removed (time is denoted in minutes). Second panel from left — fold-changes in representation of the 61 codons in yeast's transcriptome 
throughout a 15-day wine fermentation. Columns marked with 'stress' denote the fold-changes at a given stress compared with time point zero, 
before the stress was applied, whereas 'recovery' marks columns which show the fold-changes after the stress, as cells were transferred to 
non-stressful conditions, compared with the stress conditions (refers to time point 45 of the respective stress). Columns marked with 'WF represent 
the various time points (in hours) during the wine fermentation process. The matrices were normalized by dividing the fold-change values of 
individual codons at each time point to the total fold-change values across all codons. In addition the right most two columns depict for each 
codon the gene copy number of its fully matched tRNAs (tRNA's GCN), as well as its codon adaptiveness value. The codons in the matrices are 
sorted according to their fold-change increase in the stresses. As seen, codons that are increased in representation during either stress or fermentation 
have low tRNA GCN and low adaptiveness values. 



Figure S2 and legends for numerical details). This result 
means that during stress the transcriptome's codon usage 
shifts from codons that correspond to the tRNAs that are 
represented by high gene counts toward codons that cor- 
respond to tRNAs that are typically encoded by few gene 
copies. 

Stressful conditions induce a similar pattern of changes 
in codon usage in additional species 

We next wanted to check whether the tendency to increase 
the representation of codons that correspond to low gene 
copy tRNAs in stressful conditions is shared in other 
species too. We analyzed the fission yeast S. pombe 
during a short-term response to nitrogen starvation (22) 
and the worm C. elegans in response to oxidative stress 
(23). The results clearly show a consistent trend — increased 
representation in the transcriptome upon stress of codons 
that correspond to tRNAs whose GCN is low — the 
Pearson correlation between the change in codon 
representation in the transcriptome in stress and the 



availability of the corresponding tRNA is —0.59 
CP-value = 5.52 x 10" 7 ) and -0.8 (P-value = 
1.02 x 10~ 14 ), for C. elegans and S. pombe, respectively 
(Figure 3b). 

In some cases a given codon may have high tRNA avail- 
ability in one species and a low availability in another. 
We found out that such codons show increased represen- 
tation in the stressed transcriptome only in the species 
where their corresponding tRNAs are at low level. For 
instance, codon GGA (Gly) has the highest W t (codon 
adaptiveness value) in C. elegans, and relatively low 
value in S. cerevisiae, and reassuringly it shows decreased 
representation in the transcriptome of worm upon oxida- 
tive stress, whereas in yeast it is among the ten most 
elevated codons in stress. Thus the similarity of our 
trend across species does not simply result from a 
similar behavior of the same codon across species, but 
may rather reflect a commonality in which the codons 
that correspond to the rare tRNAs in each species 
respond similarly. 
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Distinct stress-specific genes induce a similar signature of 
changes in codon usage in various stresses 

Our results in 5. cerevisiae show a similar signature of codon 
usage change in stress across multiple stress types. This simi- 
larity could simply arise from a common set of general- 
stress genes that respond similarly to all stresses (32). In 
contrast we wanted to examine the possibility that distinct 
genes sets, that are each specific to each of the stresses, may 
also present the same trend. To examine this possibility we 
created distinct gene sets, each shows induction or repres- 
sion of at least 2-fold in only one of the stress conditions. 
In addition we created a 'general stress gene set' that 
consists of genes that were either induced or repressed by 
at least 2-fold in all examined stresses. We characterized 
each of these gene sets by the change in the representation 
of each of the 6 1 codons in the various stress types (con- 
sidering both the codon usage and the mRNAs levels see 
Materials and Methods for details). Then, we compared 
between the variations in codon representation of the gene 
sets across environmental changes (Figure 4). 



Interestingly we found that all stress-specific gene sets 
show highly correlated pattern of change, and that each 
of them correlated negatively with tRNA availability in 
terms of codons' tAI values (Pearson correlations 
between —0.62 and —0.73; ^-values << 0.05). In 
contrast, the general stress genes have a different signature 
that does not correlate significantly with tRNA availabil- 
ity (Pearson correlations between —0.22 and 0.13). This 
analysis indicates that in each stress type distinct genes 
converge upon the same pattern of increased representa- 
tion of codons that correspond to low GCN tRNAs. 
On the other hand, the general stress genes are actually 
better adapted to the high gene copy tRNAs. 

Why are not all genes codon-optimized? A potential 
effect of genetic drift and a limited tRNA supply 

An intriguing question is why did not evolution opti- 
mize the codon usage of all genes in a genome? We 
hypothesize that at least three factors may account for 
this situation: (i) Drift versus selection balance: drift 
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Figure 3. Correlation between variation in codons' usage and their translational efficiency across environmental changes. Each panel denotes the 
correlation between the adaptiveness values (Wj) of the 61 codons and the fold-changes in their representation in the transcriptome (a) Left panel — 
response to osmotic stress in S. cerevisiae; Pearson correlation = —0.66, P-value = 5.37 x 10~ 9 . Right panel — recovery from osmotic stress; Pearson 
correlation = 0.69, P-value = 1.01 x 10~ 9 . (b) Left panel — a short-term response to nitrogen starvation in S. pomhe. Pearson correlation = —0.8, 
P-values = 1.02 x 10 -1 ; Right panel — response to oxidative stress in C. elegans, Pearson correlation = —0.59, P-value = 5.52 x 10~ 7 . 
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Figure 3. Continued. 



may erode codon optimization of genes, thus counteract- 
ing the force of purifying selection and codon optimiza- 
tion in particular. Intuitively, while drift should constantly 
act on all genes at all times, selection should mainly act on 
a gene during evolutionary time periods and conditions in 
which it is needed. Thus genes that are expressed rarely 
during the life-history would experience compromised 
selection-to-drift ratio, (ii) Limitation in the supply: if 
the pool of amino acid-loaded tRNAs is not in great 
excess (see Discussion) and all genes were biased toward 
codons that correspond to abundant tRNAs then the 
demand for such tRNAs might be too high and transla- 
tion would not be efficient even in genes in which high 
translational efficiency is most needed and thus expressed. 
Hence some genes may need to 'give way' to others, (hi) In 
addition, it is entirely possible that some genes may 
actually be needed in low level of expression and are 
thus deliberately not codon-optimized (42,43). 

Focusing on testing the first two scenarios we 
constructed an evolutionary dynamics computer simula- 
tion to test the effect of drift-to-selection ratio and of 
demand-to-supply ratio on codon optimization levels of 
genes. Our simulated model consists of a constant-size 
population of 1 000 000 cells, each possess a genotype 



of 6 archetypal genes — a house-keeping gene which is 
expressed in every environment and growth condition, a 
'good-life' gene which is expressed only at favorable 
growth conditions, three 'stress-specific' genes, each 
uniquely associated with one out of three different stress 
types, and a 'general-stress' gene, which is expressed in any 
of the stress types, but not during the favorable growth 
condition. The population experiences changes in the en- 
vironmental conditions that could be either stressful or 
optimal. Stresses come in three different types. The differ- 
ent genes are either expressed or not depending on the 
prevailing conditions at a given moment. During cell 
doublings mutations are randomly seeded in the 
genome. These mutations are modeled here as changing 
codon usage and we refer to them by their effect on ex- 
pression levels. The fitness of each cell in a given condition 
is determined by the expression level of all the genes that 
are needed at that condition, and growth rate is propor- 
tional to fitness (see 'Materials and Methods' section for 
further details). For example, under a particular stress the 
fitness would be affected by the expression level of the 
gene that is needed specifically at that condition, and 
also by the expression level of the 'house-keeping' and 
general-stress genes. Thus each gene is subject to the 
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Figure 4. Clustering of codon usage profiles of general stress genes and stress-specific genes. This analysis compared between the codon usage 
profiles of various sets of genes that are characterized by their mRNA expression pattern across environmental changes. The fold-changes in the 
codon usage at each time point of a given stress were calculated separately for the stress specific and the general stress genes. In addition the 
codon-tAI values ( Wt) are shown. Each column and row corresponds to one time point in a particular stress. The name of each codon set consists of 
the condition, followed by time point and a 'S' or 'G' designation indicating whether it was derived from the genes that were specifically changed at 
that stress or the general stress responsive genes, respectively. Hierarchical clustering was performed with 1 -Pearson correlation as a distance metric 
and 'average linkage'. 



effect of drift throughout the simulation, but selection is 
acting upon it only during times in which the environment 
is presenting the conditions that require it. We ran the 
simulation for 10000 generations (see 'Materials and 
Methods' section) and followed the extent of expression 
level of the various genes. We ran the simulation in two 
modes. In the first mode, there was no bound on the total 
expression level from all the genes in the genome. This 
mode simulates a situation in which the tRNA supply 
was not limiting and only drift limits expression of 
genes. In the second mode of the simulation the supply 
of the tRNA is limited so that not all genes can be 
optimized simultaneously. Hence, whereas in the first 
mode only drift can compromise expression level of 
genes, in the second one it was a combination of drift 
and limited tRNA supply that could act together in 
limiting expression levels of genes. For each mode, we 
applied three different environmental regimes, in which 
the total duration of stressful conditions constitutes 20, 
50 or 80% of the total evolutionary time. 

The results of the simulation, under the two modes 
(Figure 5a) show that on average, the 'house-keeping' 
gene has the highest expression level, followed by either 



the good-life gene or general stress gene, depending on the 
fraction of evolutionary time that the population spent in 
stressful or non-stressful conditions. Regardless of the 
duration of stressful or non-stressful conditions, the 
stress-specific genes show the lowest expression level. 
The differences are observed when drift alone limits 
genes expression and it becomes even more pronounced 
when imposing a limitation on the tRNA supply. 

How well does the simulation predict differences in 
codon optimization between house-keeping, stress- 
specific, general-stress and 'good-life' genes in the 
genome? To test the simulation we examined translational 
efficiency, by the tAI measure in the following 5. cerevisiae 
gene sets, defined based on micro-array expression levels: 
(i) 'house-keeping genes', defined here as genes that 
maintain constant expression level in all conditions 
(maximal absolute change in expression of 15%); (ii) 
'good-life' genes which are repressed under stressful con- 
ditions; (iii) general stress genes which are induced in every 
stress; and (iv) stress-specific genes which are induced in 
exactly one of the examined stresses (set (iii) and (iv) 
defined as mentioned above). Figure 5b shows the mean 
tAI ('Materials and Methods' section) values of the genes 
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Figure 5. Translational efficiency of different gene sets: comparison of genome data and simulations, (a) Measurements of translational efficiency for 
environmental-dependent gene sets by computational simulation. We simulated population of cells, each possessing a genotype of six genes — a 
house-keeping gene (green) which is expressed constitutively, a 'good-life' gene (purple), which is expressed only at favorable growth conditions, three 
stress-specific genes, each expressed in only one of the stress conditions (represented by their average, colored in gray), and, a 'general stress' gene 
(blue), which is expressed in all of the stress types. Shown for each case is the averaged expression level across the entire simulated evolutionary time, 
each average, in turn, represents a mean of 50 independent runs of the simulation. The upper panel displays a simulation mode in which only drift 
limits expression of genes, whereas in the lower panel the tRNA supply too is limited so that not all genes can be optimized simultaneously. The 
three blocks of bars represent three different environmental regimes that differ in the percentage of evolutionary time in which the population was 
exposed to stress, (b) Measurements of averaged gene tAI values for various sets of genes from the S. cerevisiae genome. The bars show the mean tAI 
value for house keeping genes (green), 'good-life' genes, defined to be the stress-repressed genes (purple), general stress genes (blue) and stress-specific 
genes, i.e. the union of the gene sets that are specific to each of the stresses (gray). 



in each of the four gene sets. The results show a clear rank 
with significant differences between the gene sets: house- 
keeping genes show the highest tAI values, followed by the 
'good-life' genes, followed by the general stress genes, and 
ending with the stress-specific genes who show the lowest 
values (all differences are significant /"-value < 0.05, 
f-test). These results are in good agreement with the simu- 
lation and they suggest that genes that are rarely used 
during the life-history would be poorly optimized and 
display low expression levels. 

DISCUSSION 

Traditional measures of translational efficiency either 
consider the codon usage pattern of the coding sequence 
alone, or additionally weigh the tRNA pool. Yet, even if 
the tRNA supply is explicitly taken into consideration, 
translational efficiency should be further evaluated by 
the actual consumption of tRNAs. Thus, we suggest 
here that translational efficiency should be thought as a 
demand versus supply model, in which the supply is given 
by the tRNAs availability, and the demand is captured by 



the representation of the 'consumers 1 — the codons — in the 
transcriptome. We show here that the demand varies 
across conditions, and we anticipate that future investiga- 
tions will strengthen the notion that the supply too is not 
constant either in time or between cell types and tissues 
(10,13). 

Our comprehensive model of translational efficiency is 
based on analysis of the supply-to-demand of the various 
tRNAs that translate each gene (Figure 6). Both demand 
and supply may be constant or change across conditions, 
cell types and developmental stages. Thus, our model of 
translation efficiency challenges the prevailing simplifying 
assumption that translation efficiency of a given gene is 
constant throughout organism life. In turn, our model 
implies for dynamic range of translation efficiency, sug- 
gesting that the interplay between tRNAs availability and 
codons representation play a role in shaping expression 
levels of individual genes throughout organism life. 

Recently, the change in tRNA synthesis was measured 
across organs and species in several mammals (13) and 
was shown to be relatively constant within sets of 
tRNAs that share an anticodon. Such technology may 
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Figure 6. Putative dynamics of supply and demand for tRNAs and its implication on translational efficiency. The figure illustrates five potential 
modes of control of changes in the tRNAs supply (blue) and demand (red). In the various regimes the supply and demand might both be constant, or 
either of them, or both of them might change dynamically. Corresponding to each mode of control is the expected effect on translational efficiency 
(bottom, green). 



reveal potential changes in the tRNA pool in micro- 
organisms when they respond to different conditions. 
A change or lack of a change in the tRNA supply would 
affect translation provided that the tRNA is in limiting 
amounts. Utilizing published estimations on the 
amount of tRNAs molecules and the translated portion 
of mRNAs in yeast, we calculated (Supplementary 
Material) a tRNAs-to-codons ratio for codons. Our cal- 
culations, although rough, suggest that tRNAs and their 
respective codons are present in the cell in the same order 
of magnitudes of copy numbers. This result implies that 
there is no appreciable surplus of tRNAs which could 
buffer changes in the demand without a change in the 
basal tRNA levels. 

A main observation of this study is that the stress tran- 
scriptome is poorly adapted to the constant tRNA pool. 
The challenge is thus to explain this somewhat 
non-intuitive finding. Clearly a fitness advantage could 
be gained from better adapted stress codon usage. Yet, 
as we suggested above, a force that counteracts adaptation 
is genetic drift. Using a computational simulation, we 
demonstrate that the balance between drift and selection 
on an evolutionary time scale may explain the low adap- 
tation of stress-specific genes. Real genomes too have been 
shown to follow such logic, whereby genes that are needed 
infrequently in the ecology or life-style of a given species 
remain poorly adapted to its tRNA pool (5,17,44,45). For 
these genes the eroding effect of drift prohibits optimal 
adaptation. This scenario is consistent with case no. 2 in 
Figure 6, which discusses the possibility that the tRNA 
pool does not change to match, or counteract a change 
in codon usage of the transcriptome. However, the simu- 
lation results of course do not exclude the possibility that 
the tRNA pool does change in a condition-dependent 



manner. Indeed our recent observation actually indicate 
in that direction (9) as they show that during diauxic shift 
in yeast the tRNA pool does change to a minor extent. 
Future experimental efforts along these lines would be 
needed to establish the possibility that the tRNA pool 
changes dynamically potentially to off-set changes at the 
demand. 
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